Overview

Dataset statistics

Number of variables31
Number of observations9082
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 MiB
Average record size in memory200.0 B

Variable types

BOOL15
NUM11
CAT5

Reproduction

Analysis started2021-04-21 15:32:21.363310
Analysis finished2021-04-21 15:33:43.867825
Duration1 minute and 22.5 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Provider Name has a high cardinality: 212 distinct values High cardinality
Provider City has a high cardinality: 100 distinct values High cardinality
Total Number of Occupied Beds is highly correlated with Number of All BedsHigh correlation
Number of All Beds is highly correlated with Total Number of Occupied BedsHigh correlation
Week Ending is uniformly distributed Uniform
Provider Name is uniformly distributed Uniform
Residents Total Admissions COVID-19 has 1980 (21.8%) zeros Zeros
Residents Total Confirmed COVID-19 has 1362 (15.0%) zeros Zeros
Residents Total Suspected COVID-19 has 2621 (28.9%) zeros Zeros
Residents Weekly All Deaths has 5784 (63.7%) zeros Zeros
Residents Total All Deaths has 361 (4.0%) zeros Zeros
Residents Total COVID-19 Deaths has 2087 (23.0%) zeros Zeros
Staff Total Confirmed COVID-19 has 827 (9.1%) zeros Zeros
Staff Weekly Suspected COVID-19 has 8549 (94.1%) zeros Zeros
Staff Total Suspected COVID-19 has 2797 (30.8%) zeros Zeros

Variables

Week Ending
Categorical

UNIFORM

Distinct count43
Unique (%)0.5%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
2020-06-14T00:00:00
 
212
2020-09-27T00:00:00
 
212
2020-09-20T00:00:00
 
212
2020-08-23T00:00:00
 
212
2020-07-19T00:00:00
 
212
Other values (38)
8022
ValueCountFrequency (%) 
2020-06-14T00:00:002122.3%
 
2020-09-27T00:00:002122.3%
 
2020-09-20T00:00:002122.3%
 
2020-08-23T00:00:002122.3%
 
2020-07-19T00:00:002122.3%
 
2020-06-28T00:00:002122.3%
 
2020-05-31T00:00:002122.3%
 
2020-05-24T00:00:002122.3%
 
2020-08-09T00:00:002122.3%
 
2020-09-13T00:00:002122.3%
 
Other values (33)696276.7%
 

Length

Max length19
Median length19
Mean length19
Min length19

Provider Name
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count212
Unique (%)2.3%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
ORANGE HEALTH CARE CENTER
 
43
GROVE MANOR NURSING HOME, INC
 
43
APPLE REHAB ROCKY HILL
 
43
CASSENA CARE AT STAMFORD
 
43
ABBOTT TERR HEALTH CTR
 
43
Other values (207)
8867
ValueCountFrequency (%) 
ORANGE HEALTH CARE CENTER430.5%
 
GROVE MANOR NURSING HOME, INC430.5%
 
APPLE REHAB ROCKY HILL430.5%
 
CASSENA CARE AT STAMFORD430.5%
 
ABBOTT TERR HEALTH CTR430.5%
 
NORTHBRIDGE HEALTH CARE CENTER430.5%
 
HAMDEN REHABILITATION & HEALTH CARE CENTER430.5%
 
VILLA MARIA NURSING & REHAB COMMUNITY, INC430.5%
 
AUTUMN LAKE HEALTHCARE AT NEW BRITAIN430.5%
 
ST JOSEPHS LIVING CENTER430.5%
 
Other values (202)865295.3%
 

Length

Max length50
Median length26
Mean length27.03501431
Min length7

Provider City
Categorical

HIGH CARDINALITY

Distinct count100
Unique (%)1.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
WATERBURY
 
334
MERIDEN
 
301
STAMFORD
 
215
NEW HAVEN
 
215
NEW BRITAIN
 
215
Other values (95)
7802
ValueCountFrequency (%) 
WATERBURY3343.7%
 
MERIDEN3013.3%
 
STAMFORD2152.4%
 
NEW HAVEN2152.4%
 
NEW BRITAIN2152.4%
 
WEST HARTFORD2152.4%
 
DANBURY2152.4%
 
BRISTOL1721.9%
 
MIDDLETOWN1721.9%
 
HARTFORD1721.9%
 
Other values (90)685675.5%
 

Length

Max length16
Median length9
Mean length8.759083902
Min length4

Residents Total Admissions COVID-19
Real number (ℝ≥0)

ZEROS

Distinct count196
Unique (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.213279013433164
Minimum0
Maximum535
Zeros1980
Zeros (%)21.8%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median6
Q317
95-th percentile56
Maximum535
Range535
Interquartile range (IQR)16

Descriptive statistics

Standard deviation31.65158627
Coefficient of variation (CV)2.080523617
Kurtosis74.6270188
Mean15.21327901
Median Absolute Deviation (MAD)6
Skewness6.898305587
Sum138167
Variance1001.822914
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0198021.8%
 
19029.9%
 
24915.4%
 
44615.1%
 
73734.1%
 
103573.9%
 
63323.7%
 
53103.4%
 
32963.3%
 
82713.0%
 
Other values (186)330936.4%
 
ValueCountFrequency (%) 
0198021.8%
 
19029.9%
 
24915.4%
 
32963.3%
 
44615.1%
 
ValueCountFrequency (%) 
5351< 0.1%
 
5281< 0.1%
 
5211< 0.1%
 
5131< 0.1%
 
4941< 0.1%
 

Residents Total Confirmed COVID-19
Real number (ℝ≥0)

ZEROS

Distinct count159
Unique (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.77659105923805
Minimum0
Maximum171
Zeros1362
Zeros (%)15.0%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14
median28
Q363
95-th percentile102
Maximum171
Range171
Interquartile range (IQR)59

Descriptive statistics

Standard deviation35.79671296
Coefficient of variation (CV)0.9475898156
Kurtosis0.06923640367
Mean37.77659106
Median Absolute Deviation (MAD)27
Skewness0.8418299912
Sum343087
Variance1281.404658
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0136215.0%
 
13994.4%
 
22352.6%
 
31972.2%
 
51902.1%
 
181651.8%
 
281631.8%
 
201611.8%
 
231441.6%
 
171321.5%
 
Other values (149)593465.3%
 
ValueCountFrequency (%) 
0136215.0%
 
13994.4%
 
22352.6%
 
31972.2%
 
41071.2%
 
ValueCountFrequency (%) 
1712< 0.1%
 
170150.2%
 
1691< 0.1%
 
1681< 0.1%
 
1671< 0.1%
 

Residents Total Suspected COVID-19
Real number (ℝ≥0)

ZEROS

Distinct count107
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.439440651838803
Minimum0
Maximum293
Zeros2621
Zeros (%)28.9%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median3
Q318
95-th percentile69
Maximum293
Range293
Interquartile range (IQR)18

Descriptive statistics

Standard deviation32.80970818
Coefficient of variation (CV)2.125058085
Kurtosis30.82677992
Mean15.43944065
Median Absolute Deviation (MAD)3
Skewness4.812728346
Sum140221
Variance1076.476951
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0262128.9%
 
1107111.8%
 
25986.6%
 
35175.7%
 
42873.2%
 
62723.0%
 
52132.3%
 
202002.2%
 
71822.0%
 
211782.0%
 
Other values (97)294332.4%
 
ValueCountFrequency (%) 
0262128.9%
 
1107111.8%
 
25986.6%
 
35175.7%
 
42873.2%
 
ValueCountFrequency (%) 
293250.3%
 
292120.1%
 
2881< 0.1%
 
2861< 0.1%
 
2791< 0.1%
 

Residents Weekly All Deaths
Real number (ℝ≥0)

ZEROS

Distinct count44
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8313146883946267
Minimum0
Maximum71
Zeros5784
Zeros (%)63.7%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum71
Range71
Interquartile range (IQR)1

Descriptive statistics

Standard deviation2.777344239
Coefficient of variation (CV)3.34090601
Kurtosis175.2695731
Mean0.8313146884
Median Absolute Deviation (MAD)0
Skewness11.04748687
Sum7550
Variance7.71364102
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0578463.7%
 
1203322.4%
 
27248.0%
 
32232.5%
 
41041.1%
 
5340.4%
 
6240.3%
 
7240.3%
 
8140.2%
 
10130.1%
 
Other values (34)1051.2%
 
ValueCountFrequency (%) 
0578463.7%
 
1203322.4%
 
27248.0%
 
32232.5%
 
41041.1%
 
ValueCountFrequency (%) 
711< 0.1%
 
661< 0.1%
 
561< 0.1%
 
521< 0.1%
 
511< 0.1%
 

Residents Total All Deaths
Real number (ℝ≥0)

ZEROS

Distinct count138
Unique (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.088526756221096
Minimum0
Maximum196
Zeros361
Zeros (%)4.0%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile1
Q19
median19
Q332
95-th percentile61
Maximum196
Range196
Interquartile range (IQR)23

Descriptive statistics

Standard deviation20.54581936
Coefficient of variation (CV)0.8898713883
Kurtosis7.830760268
Mean23.08852676
Median Absolute Deviation (MAD)11
Skewness2.063673226
Sum209690
Variance422.1306931
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03614.0%
 
13433.8%
 
192993.3%
 
32943.2%
 
152542.8%
 
102482.7%
 
22382.6%
 
212372.6%
 
92302.5%
 
112292.5%
 
Other values (128)634969.9%
 
ValueCountFrequency (%) 
03614.0%
 
13433.8%
 
22382.6%
 
32943.2%
 
42202.4%
 
ValueCountFrequency (%) 
1961< 0.1%
 
1921< 0.1%
 
1912< 0.1%
 
1861< 0.1%
 
1851< 0.1%
 

Residents Total COVID-19 Deaths
Real number (ℝ≥0)

ZEROS

Distinct count65
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.087976216692358
Minimum0
Maximum84
Zeros2087
Zeros (%)23.0%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median7
Q316
95-th percentile29
Maximum84
Range84
Interquartile range (IQR)15

Descriptive statistics

Standard deviation10.92314658
Coefficient of variation (CV)1.082788693
Kurtosis5.562729679
Mean10.08797622
Median Absolute Deviation (MAD)7
Skewness1.81117525
Sum91619
Variance119.3151313
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0208723.0%
 
15606.2%
 
74985.5%
 
23654.0%
 
53594.0%
 
63493.8%
 
133423.8%
 
43283.6%
 
93263.6%
 
82993.3%
 
Other values (55)356939.3%
 
ValueCountFrequency (%) 
0208723.0%
 
15606.2%
 
23654.0%
 
31721.9%
 
43283.6%
 
ValueCountFrequency (%) 
8450.1%
 
833< 0.1%
 
813< 0.1%
 
782< 0.1%
 
761< 0.1%
 

Number of All Beds
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count92
Unique (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean116.31424796300374
Minimum0
Maximum360
Zeros3
Zeros (%)< 0.1%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile45
Q176
median120
Q3137
95-th percentile217
Maximum360
Range360
Interquartile range (IQR)61

Descriptive statistics

Standard deviation55.49257174
Coefficient of variation (CV)0.4770917812
Kurtosis4.211328111
Mean116.314248
Median Absolute Deviation (MAD)30
Skewness1.533746068
Sum1056366
Variance3079.425519
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
120132814.6%
 
607988.8%
 
905886.5%
 
1504875.4%
 
1304054.5%
 
752582.8%
 
1282152.4%
 
1602152.4%
 
1801721.9%
 
1261721.9%
 
Other values (82)444448.9%
 
ValueCountFrequency (%) 
03< 0.1%
 
23430.5%
 
25430.5%
 
30860.9%
 
35860.9%
 
ValueCountFrequency (%) 
360430.5%
 
357430.5%
 
345430.5%
 
294430.5%
 
282430.5%
 

Total Number of Occupied Beds
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count277
Unique (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean85.05450341334507
Minimum0
Maximum901
Zeros34
Zeros (%)0.4%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile29
Q156
median81
Q3104
95-th percentile159
Maximum901
Range901
Interquartile range (IQR)48

Descriptive statistics

Standard deviation42.85129212
Coefficient of variation (CV)0.5038097973
Kurtosis18.40884244
Mean85.05450341
Median Absolute Deviation (MAD)24
Skewness2.139015977
Sum772465
Variance1836.233237
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
811842.0%
 
771361.5%
 
781351.5%
 
821311.4%
 
831311.4%
 
801251.4%
 
841241.4%
 
791241.4%
 
751201.3%
 
851171.3%
 
Other values (267)775585.4%
 
ValueCountFrequency (%) 
0340.4%
 
14< 0.1%
 
21< 0.1%
 
41< 0.1%
 
52< 0.1%
 
ValueCountFrequency (%) 
9011< 0.1%
 
3231< 0.1%
 
3171< 0.1%
 
3142< 0.1%
 
3131< 0.1%
 
Distinct count47
Unique (%)0.5%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
0
6708
1
 
1251
2
 
456
3
 
237
4
 
98
Other values (42)
 
332
ValueCountFrequency (%) 
0670873.9%
 
1125113.8%
 
24565.0%
 
32372.6%
 
4981.1%
 
6550.6%
 
5510.6%
 
7360.4%
 
8310.3%
 
9190.2%
 
Other values (37)1401.5%
 

Length

Max length2
Median length1
Mean length1.015415107
Min length1

Staff Total Confirmed COVID-19
Real number (ℝ≥0)

ZEROS

Distinct count118
Unique (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.67022682228584
Minimum0
Maximum130
Zeros827
Zeros (%)9.1%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14
median15
Q331
95-th percentile60
Maximum130
Range130
Interquartile range (IQR)27

Descriptive statistics

Standard deviation20.46763429
Coefficient of variation (CV)0.990198824
Kurtosis3.471336276
Mean20.67022682
Median Absolute Deviation (MAD)12
Skewness1.597035582
Sum187727
Variance418.9240534
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
08279.1%
 
15365.9%
 
23784.2%
 
33293.6%
 
52813.1%
 
42743.0%
 
82723.0%
 
232202.4%
 
142182.4%
 
152162.4%
 
Other values (108)553160.9%
 
ValueCountFrequency (%) 
08279.1%
 
15365.9%
 
23784.2%
 
33293.6%
 
42743.0%
 
ValueCountFrequency (%) 
13050.1%
 
1261< 0.1%
 
1231< 0.1%
 
11770.1%
 
116110.1%
 

Staff Weekly Suspected COVID-19
Real number (ℝ≥0)

ZEROS

Distinct count36
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2176833296630698
Minimum0
Maximum53
Zeros8549
Zeros (%)94.1%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum53
Range53
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.908507433
Coefficient of variation (CV)8.767356859
Kurtosis336.3682511
Mean0.2176833297
Median Absolute Deviation (MAD)0
Skewness16.69552629
Sum1977
Variance3.642400623
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0854994.1%
 
12843.1%
 
21081.2%
 
3410.5%
 
4220.2%
 
5100.1%
 
690.1%
 
870.1%
 
1050.1%
 
74< 0.1%
 
Other values (26)430.5%
 
ValueCountFrequency (%) 
0854994.1%
 
12843.1%
 
21081.2%
 
3410.5%
 
4220.2%
 
ValueCountFrequency (%) 
532< 0.1%
 
481< 0.1%
 
421< 0.1%
 
411< 0.1%
 
391< 0.1%
 

Staff Total Suspected COVID-19
Real number (ℝ≥0)

ZEROS

Distinct count68
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.109997797841885
Minimum0
Maximum80
Zeros2797
Zeros (%)30.8%
Memory size35.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q310
95-th percentile36
Maximum80
Range80
Interquartile range (IQR)10

Descriptive statistics

Standard deviation13.26590612
Coefficient of variation (CV)1.635747191
Kurtosis6.024255523
Mean8.109997798
Median Absolute Deviation (MAD)2
Skewness2.412339573
Sum73655
Variance175.9842652
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0279730.8%
 
1105111.6%
 
28799.7%
 
36327.0%
 
53303.6%
 
102713.0%
 
42683.0%
 
62112.3%
 
82052.3%
 
92012.2%
 
Other values (58)223724.6%
 
ValueCountFrequency (%) 
0279730.8%
 
1105111.6%
 
28799.7%
 
36327.0%
 
42683.0%
 
ValueCountFrequency (%) 
80170.2%
 
68170.2%
 
66190.2%
 
651< 0.1%
 
63180.2%
 
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size35.5 KiB
0
8505
1
 
534
2
 
43
ValueCountFrequency (%) 
0850593.6%
 
15345.9%
 
2430.5%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
N
8574
Y
 
508
ValueCountFrequency (%) 
N857494.4%
 
Y5085.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
N
8880
Y
 
202
ValueCountFrequency (%) 
N888097.8%
 
Y2022.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
N
8531
Y
 
551
ValueCountFrequency (%) 
N853193.9%
 
Y5516.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
N
8802
Y
 
280
ValueCountFrequency (%) 
N880296.9%
 
Y2803.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
8670
N
 
412
ValueCountFrequency (%) 
Y867095.5%
 
N4124.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
7797
N
 
1285
ValueCountFrequency (%) 
Y779785.9%
 
N128514.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
8681
N
 
401
ValueCountFrequency (%) 
Y868195.6%
 
N4014.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
8618
N
 
464
ValueCountFrequency (%) 
Y861894.9%
 
N4645.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
8682
N
 
400
ValueCountFrequency (%) 
Y868295.6%
 
N4004.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
8638
N
 
444
ValueCountFrequency (%) 
Y863895.1%
 
N4444.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
8380
N
 
702
ValueCountFrequency (%) 
Y838092.3%
 
N7027.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
9025
N
 
57
ValueCountFrequency (%) 
Y902599.4%
 
N570.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
N
8519
Y
 
563
ValueCountFrequency (%) 
N851993.8%
 
Y5636.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
9060
N
 
22
ValueCountFrequency (%) 
Y906099.8%
 
N220.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
Y
9047
N
 
35
ValueCountFrequency (%) 
Y904799.6%
 
N350.4%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

Week EndingProvider NameProvider CityResidents Total Admissions COVID-19Residents Total Confirmed COVID-19Residents Total Suspected COVID-19Residents Weekly All DeathsResidents Total All DeathsResidents Total COVID-19 DeathsNumber of All BedsTotal Number of Occupied BedsStaff Weekly Confirmed COVID-19Staff Total Confirmed COVID-19Staff Weekly Suspected COVID-19Staff Total Suspected COVID-19Staff Total COVID-19 DeathsShortage of Nursing StaffShortage of Clinical StaffShortage of AidesShortage of Other StaffAny Current Supply of N95 MasksOne-Week Supply of N95 MasksAny Current Supply of Surgical MasksOne-Week Supply of Surgical MasksAny Current Supply of Eye ProtectionOne-Week Supply of Eye ProtectionOne-Week Supply of GownsAny Current Supply of Hand SanitizerThree or More Confirmed COVID-19 Cases This Week or Initial Confirmed COVID-19 Case this WeekAble to Test or Obtain Resources to Test All Current Residents Within Next 7 DaysAble to Test or Obtain Resources to Test All Staff and/or Personnel Within Next 7 Days
02020-05-24T00:00:00NORTHBRIDGE HEALTH CARE CENTERBRIDGEPORT25000000011000NNNNYYYYYYYYNYY
12020-05-24T00:00:00ORCHARD GROVE SPECIALTY CARE CENTER, LLCUNCASVILLE6283019712081015020NNNNYYYYYYYYNYY
22020-07-19T00:00:00WATROUS NURSING CENTERMADISON011000452901000NNNNYYYYYYYYNYY
32020-06-28T00:00:00APPLE REHAB SHELTON LAKESSHELTON1001201067300000NNNNYYYYYYYYNYY
42020-09-13T00:00:00REGALCARE AT GREENWICHGREENWICH1112002814756207000NNNNYYYYYYYYNYY
52020-11-15T00:00:00WITHERELL, NATHANIELGREENWICH161301401616202152111000NNNNYYYYYYYYNYY
62020-12-27T00:00:00AVERY NURSING HOMEHARTFORD25821139211991411920110NNNNYYYYYYYYYYY
72020-08-30T00:00:00REGALCARE AT GREENWICHGREENWICH1112002814756407000NNNNYYYYYYYYNYY
82020-10-11T00:00:00GROVE MANOR NURSING HOME, INCWATERBURY2290021604317000YYYYYYYYYYYYNYY
92021-01-31T00:00:00WINDSOR HEALTH AND REHABILITATION CENTER, LLCWINDSOR781034010888018000YNYYYYYYYYYYNYY

Last rows

Week EndingProvider NameProvider CityResidents Total Admissions COVID-19Residents Total Confirmed COVID-19Residents Total Suspected COVID-19Residents Weekly All DeathsResidents Total All DeathsResidents Total COVID-19 DeathsNumber of All BedsTotal Number of Occupied BedsStaff Weekly Confirmed COVID-19Staff Total Confirmed COVID-19Staff Weekly Suspected COVID-19Staff Total Suspected COVID-19Staff Total COVID-19 DeathsShortage of Nursing StaffShortage of Clinical StaffShortage of AidesShortage of Other StaffAny Current Supply of N95 MasksOne-Week Supply of N95 MasksAny Current Supply of Surgical MasksOne-Week Supply of Surgical MasksAny Current Supply of Eye ProtectionOne-Week Supply of Eye ProtectionOne-Week Supply of GownsAny Current Supply of Hand SanitizerThree or More Confirmed COVID-19 Cases This Week or Initial Confirmed COVID-19 Case this WeekAble to Test or Obtain Resources to Test All Current Residents Within Next 7 DaysAble to Test or Obtain Resources to Test All Staff and/or Personnel Within Next 7 Days
90722020-08-16T00:00:00SOUTHINGTON CARE CENTERSOUTHINGTON145912115913097031090NNNNYYYYYYYYNYY
90732021-01-17T00:00:00APPLE REHAB COLCHESTERCOLCHESTER04500426037033000NNNNYYYYYYYYNYY
90742020-07-12T00:00:00AVON HEALTH CENTERAVON550750327120920120320NNNNYYYYYYYYNYY
90752020-12-20T00:00:00TOUCHPOINTS AT BLOOMFIELDBLOOMFIELD641022020151501180210591NNNNYNYYYYYYNYY
90762020-11-01T00:00:00WATROUS NURSING CENTERMADISON011000452701000NNNNYYYYYYYYNYY
90772020-08-23T00:00:00NOTRE DAME CONVALESCENT HOME INORWALK322380171360450200130NNNNYYYYYYYYNYY
90782021-01-24T00:00:00LEDGECREST HEALTH CAREKENSINGTON626101555548012000NNNNYYYYYYYYNYY
90792021-01-31T00:00:00JOHN L. LEVITOW HEALTH CARE CENTERROCKY HILL12340020712581242030NNNNYYYYYYYYNYY
90802020-12-06T00:00:00APPLE REHAB SHELTON LAKESSHELTON821011001067924000NNNNYYYYYYYYYYY
90812020-08-23T00:00:00BETHEL HEALTH CARE CENTERBETHEL236166044161611190300191NNNNYYYYYYYYNYY